Application of loudness/pitch/timbre decomposition operators to auditory scene analysis

نویسندگان

  • Mototsugu Abe
  • Shigeru Ando
چکیده

We proposed[1] nonlinear operators which decompose a changing energy of sound in wavelet domain into three orthogonal components: i.e., loudness and pitch as coherent changes, and timbre as incoherent change. We showed that they could detect the discontinuity of a single sound stream with excellent temporal resolution and sensitivity. In this paper, we extend the coherency principle so that it can describe and pursue the individual coherency of non-overlapping sound streams in wavelet domain. It is realized by Parzen’s non-parametric estimates and Kalman filtering of loudness change rate and pitch shift rate. Using this method, we show some experiments for extraction of the most salient stream from multiple sound streams.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Models of Timbre Using Spectro-temporal Receptive Fields: Investigation of Coding Strategies

Timbre designates all of the perceptual characteristics of sounds that cannot be described as pitch, loudness or duration. Behavioral experiments combined with multidimensional scaling techniques have proposed that a few main acoustic dimensions subserve the perception timbre for homogeneous ensembles of sounds (e.g., Western musical instrument sounds). It is unclear however whether these dimen...

متن کامل

Human Echolocation in Static Situations: Auditory Models of Detection Thresholds for Distance, Pitch, Loudness and Timbre

We investigated, by using auditory models, how three perceptual parameters, loudness, pitch and sharpness, determine human echolocation. We used acoustic recordings from two previous studies, both from stationary situations, and their resulting perceptual data as input to our analysis. An initial analysis was on the room acoustics of the recordings. The parameters of interest were sound pressur...

متن کامل

Influence of pitch, loudness, and timbre on the perception of instrument dynamics.

The effect of variations in pitch, loudness, and timbre on the perception of the dynamics of isolated instrumental tones is investigated. A full factorial design was used in a listening experiment. The subjects were asked to indicate the perceived dynamics of each stimulus on a scale from pianissimo to fortissimo. Statistical analysis showed that for the instruments included (i.e., clarinet, fl...

متن کامل

Nonlinear time-frequency domain operators for decomposing sounds into loudness, pitch and timbre

In this paper, we propose a method for decomposing instantaneous changes of sounds into three energy components, i.e., loudness, pitch, and timbre. These operators are derived from an eigenstructure analysis of the time-frequency gradient space (a 3-D space spanned by a modulus and partial derivatives of a wavelet transform). By several experiments, we found that they have superior resolution a...

متن کامل

Perceived Match between Visual Parameters and Auditory Correlates: an Experimental Multimedia Investigation

This paper investigates the relationship between the auditory and visual components of an audio-visual (A-V) composite. Participants (N=28) were asked to rate the perceived degree of “match” between A-V components in a series of randomly presented composites. Manipulated audio parameters included pitch, loudness, timbre, and duration, while visual parameters included color, vertical location, s...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1996